Overview
Brought to you by YData
Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 11495243 |
| Missing cells | 862 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.1 GiB |
| Average record size in memory | 569.4 B |
Variable types
| Text | 7 |
|---|---|
| Categorical | 1 |
| Unsupported | 1 |
Reproduction
| Analysis started | 2025-03-04 04:26:04.218505 |
|---|---|
| Analysis finished | 2025-03-04 04:32:08.296218 |
| Duration | 6 minutes and 4.08 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
tconst
Text
Unique 
| Distinct | 11495243 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 729.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.5099734 |
| Min length | 9 |
Unique
| Unique | 11495243 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | tt0000001 |
|---|---|
| 2nd row | tt0000002 |
| 3rd row | tt0000003 |
| 4th row | tt0000004 |
| 5th row | tt0000005 |
| Value | Count | Frequency (%) |
| tt0000019 | 1 | < 0.1% |
| tt9916880 | 1 | < 0.1% |
| tt0000001 | 1 | < 0.1% |
| tt0000002 | 1 | < 0.1% |
| tt0000003 | 1 | < 0.1% |
| tt0000004 | 1 | < 0.1% |
| tt0000005 | 1 | < 0.1% |
| tt0000006 | 1 | < 0.1% |
| tt0000007 | 1 | < 0.1% |
| tt0000008 | 1 | < 0.1% |
| Other values (11495233) | 11495233 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 22990486 | |
| 1 | 11177578 | |
| 2 | 10594685 | |
| 0 | 9381525 | |
| 4 | 8731344 | 8.0% |
| 3 | 8659163 | 7.9% |
| 8 | 8401135 | 7.7% |
| 6 | 8375193 | 7.7% |
| 5 | 7260022 | 6.6% |
| 7 | 6940354 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 109319455 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 22990486 | |
| 1 | 11177578 | |
| 2 | 10594685 | |
| 0 | 9381525 | |
| 4 | 8731344 | 8.0% |
| 3 | 8659163 | 7.9% |
| 8 | 8401135 | 7.7% |
| 6 | 8375193 | 7.7% |
| 5 | 7260022 | 6.6% |
| 7 | 6940354 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 109319455 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 22990486 | |
| 1 | 11177578 | |
| 2 | 10594685 | |
| 0 | 9381525 | |
| 4 | 8731344 | 8.0% |
| 3 | 8659163 | 7.9% |
| 8 | 8401135 | 7.7% |
| 6 | 8375193 | 7.7% |
| 5 | 7260022 | 6.6% |
| 7 | 6940354 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 109319455 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 22990486 | |
| 1 | 11177578 | |
| 2 | 10594685 | |
| 0 | 9381525 | |
| 4 | 8731344 | 8.0% |
| 3 | 8659163 | 7.9% |
| 8 | 8401135 | 7.7% |
| 6 | 8375193 | 7.7% |
| 5 | 7260022 | 6.6% |
| 7 | 6940354 | 6.3% |
titleType
Categorical
Imbalance 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 715.3 MiB |
| tvEpisode | |
|---|---|
| short | |
| movie | 708001 |
| video | 306953 |
| tvSeries | 277952 |
| Other values (6) | 314647 |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 8.2459039 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | short |
|---|---|
| 2nd row | short |
| 3rd row | short |
| 4th row | short |
| 5th row | short |
Common Values
| Value | Count | Frequency (%) |
| tvEpisode | 8840212 | |
| short | 1047478 | 9.1% |
| movie | 708001 | 6.2% |
| video | 306953 | 2.7% |
| tvSeries | 277952 | 2.4% |
| tvMovie | 150118 | 1.3% |
| tvMiniSeries | 60182 | 0.5% |
| tvSpecial | 51593 | 0.4% |
| videoGame | 42180 | 0.4% |
| tvShort | 10573 | 0.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| tvepisode | 8840212 | |
| short | 1047478 | 9.1% |
| movie | 708001 | 6.2% |
| video | 306953 | 2.7% |
| tvseries | 277952 | 2.4% |
| tvmovie | 150118 | 1.3% |
| tvminiseries | 60182 | 0.5% |
| tvspecial | 51593 | 0.4% |
| videogame | 42180 | 0.4% |
| tvshort | 10573 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 11105516 | |
| e | 10817505 | |
| v | 10597883 | |
| i | 10557556 | |
| t | 10448683 | |
| s | 10225824 | |
| d | 9189345 | |
| p | 8891805 | |
| E | 8840212 | |
| r | 1396185 | 1.5% |
| Other values (10) | 2718155 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 94788669 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 11105516 | |
| e | 10817505 | |
| v | 10597883 | |
| i | 10557556 | |
| t | 10448683 | |
| s | 10225824 | |
| d | 9189345 | |
| p | 8891805 | |
| E | 8840212 | |
| r | 1396185 | 1.5% |
| Other values (10) | 2718155 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 94788669 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 11105516 | |
| e | 10817505 | |
| v | 10597883 | |
| i | 10557556 | |
| t | 10448683 | |
| s | 10225824 | |
| d | 9189345 | |
| p | 8891805 | |
| E | 8840212 | |
| r | 1396185 | 1.5% |
| Other values (10) | 2718155 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 94788669 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 11105516 | |
| e | 10817505 | |
| v | 10597883 | |
| i | 10557556 | |
| t | 10448683 | |
| s | 10225824 | |
| d | 9189345 | |
| p | 8891805 | |
| E | 8840212 | |
| r | 1396185 | 1.5% |
| Other values (10) | 2718155 | 2.9% |
primaryTitle
Text
| Distinct | 5168903 |
|---|---|
| Distinct (%) | 45.0% |
| Missing | 19 |
| Missing (%) | < 0.1% |
| Memory size | 869.2 MiB |
Length
| Max length | 458 |
|---|---|
| Median length | 405 |
| Mean length | 19.866783 |
| Min length | 1 |
Unique
| Unique | 4712861 ? |
|---|---|
| Unique (%) | 41.0% |
Sample
| 1st row | Carmencita |
|---|---|
| 2nd row | Le clown et ses chiens |
| 3rd row | Poor Pierrot |
| 4th row | Un bon bock |
| 5th row | Blacksmith Scene |
| Value | Count | Frequency (%) |
| episode | 4829052 | 12.7% |
| the | 1176448 | 3.1% |
| dated | 940667 | 2.5% |
| 459592 | 1.2% | |
| of | 404479 | 1.1% |
| a | 321779 | 0.8% |
| and | 254342 | 0.7% |
| in | 232152 | 0.6% |
| to | 190123 | 0.5% |
| 2 | 150494 | 0.4% |
| Other values (1413762) | 29064003 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26527098 | 11.6% | |
| e | 20058625 | 8.8% |
| i | 13152108 | 5.8% |
| o | 12968983 | 5.7% |
| a | 11415061 | 5.0% |
| s | 11180633 | 4.9% |
| d | 10119771 | 4.4% |
| r | 8221351 | 3.6% |
| t | 8134487 | 3.6% |
| n | 8107950 | 3.6% |
| Other values (193) | 98487054 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 228373121 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 26527098 | 11.6% | |
| e | 20058625 | 8.8% |
| i | 13152108 | 5.8% |
| o | 12968983 | 5.7% |
| a | 11415061 | 5.0% |
| s | 11180633 | 4.9% |
| d | 10119771 | 4.4% |
| r | 8221351 | 3.6% |
| t | 8134487 | 3.6% |
| n | 8107950 | 3.6% |
| Other values (193) | 98487054 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 228373121 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 26527098 | 11.6% | |
| e | 20058625 | 8.8% |
| i | 13152108 | 5.8% |
| o | 12968983 | 5.7% |
| a | 11415061 | 5.0% |
| s | 11180633 | 4.9% |
| d | 10119771 | 4.4% |
| r | 8221351 | 3.6% |
| t | 8134487 | 3.6% |
| n | 8107950 | 3.6% |
| Other values (193) | 98487054 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 228373121 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 26527098 | 11.6% | |
| e | 20058625 | 8.8% |
| i | 13152108 | 5.8% |
| o | 12968983 | 5.7% |
| a | 11415061 | 5.0% |
| s | 11180633 | 4.9% |
| d | 10119771 | 4.4% |
| r | 8221351 | 3.6% |
| t | 8134487 | 3.6% |
| n | 8107950 | 3.6% |
| Other values (193) | 98487054 |
originalTitle
Text
| Distinct | 5193902 |
|---|---|
| Distinct (%) | 45.2% |
| Missing | 19 |
| Missing (%) | < 0.1% |
| Memory size | 870.6 MiB |
Length
| Max length | 458 |
|---|---|
| Median length | 405 |
| Mean length | 19.86421 |
| Min length | 1 |
Unique
| Unique | 4738067 ? |
|---|---|
| Unique (%) | 41.2% |
Sample
| 1st row | Carmencita |
|---|---|
| 2nd row | Le clown et ses chiens |
| 3rd row | Pauvre Pierrot |
| 4th row | Un bon bock |
| 5th row | Blacksmith Scene |
| Value | Count | Frequency (%) |
| episode | 4828988 | 12.7% |
| the | 1123849 | 3.0% |
| dated | 940666 | 2.5% |
| 460601 | 1.2% | |
| of | 383714 | 1.0% |
| a | 313452 | 0.8% |
| and | 247572 | 0.7% |
| in | 226118 | 0.6% |
| to | 187747 | 0.5% |
| de | 151384 | 0.4% |
| Other values (1450636) | 29135816 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26504698 | 11.6% | |
| e | 20002760 | 8.8% |
| i | 13200140 | 5.8% |
| o | 12960285 | 5.7% |
| a | 11496781 | 5.0% |
| s | 11186560 | 4.9% |
| d | 10122621 | 4.4% |
| r | 8197867 | 3.6% |
| n | 8135441 | 3.6% |
| t | 8097489 | 3.5% |
| Other values (180) | 98438897 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 228343539 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 26504698 | 11.6% | |
| e | 20002760 | 8.8% |
| i | 13200140 | 5.8% |
| o | 12960285 | 5.7% |
| a | 11496781 | 5.0% |
| s | 11186560 | 4.9% |
| d | 10122621 | 4.4% |
| r | 8197867 | 3.6% |
| n | 8135441 | 3.6% |
| t | 8097489 | 3.5% |
| Other values (180) | 98438897 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 228343539 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 26504698 | 11.6% | |
| e | 20002760 | 8.8% |
| i | 13200140 | 5.8% |
| o | 12960285 | 5.7% |
| a | 11496781 | 5.0% |
| s | 11186560 | 4.9% |
| d | 10122621 | 4.4% |
| r | 8197867 | 3.6% |
| n | 8135441 | 3.6% |
| t | 8097489 | 3.5% |
| Other values (180) | 98438897 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 228343539 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 26504698 | 11.6% | |
| e | 20002760 | 8.8% |
| i | 13200140 | 5.8% |
| o | 12960285 | 5.7% |
| a | 11496781 | 5.0% |
| s | 11186560 | 4.9% |
| d | 10122621 | 4.4% |
| r | 8197867 | 3.6% |
| n | 8135441 | 3.6% |
| t | 8097489 | 3.5% |
| Other values (180) | 98438897 |
isAdult
Unsupported
Rejected  Unsupported 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 353.8 MiB |
startYear
Text
| Distinct | 152 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 666.0 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.7520616 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1894 |
|---|---|
| 2nd row | 1892 |
| 3rd row | 1892 |
| 4th row | 1892 |
| 5th row | 1893 |
| Value | Count | Frequency (%) |
| n | 1425056 | 12.4% |
| 2021 | 507635 | 4.4% |
| 2022 | 490335 | 4.3% |
| 2018 | 457527 | 4.0% |
| 2023 | 454328 | 4.0% |
| 2019 | 453913 | 3.9% |
| 2017 | 451143 | 3.9% |
| 2020 | 435600 | 3.8% |
| 2016 | 426978 | 3.7% |
| 2015 | 401704 | 3.5% |
| Other values (142) | 5991024 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 11303024 | |
| 0 | 10420534 | |
| 1 | 7297795 | |
| 9 | 3997466 | 9.3% |
| \ | 1425056 | 3.3% |
| N | 1425056 | 3.3% |
| 8 | 1406881 | 3.3% |
| 7 | 1300194 | 3.0% |
| 3 | 1189419 | 2.8% |
| 4 | 1173229 | 2.7% |
| Other values (2) | 2192206 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43130860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 11303024 | |
| 0 | 10420534 | |
| 1 | 7297795 | |
| 9 | 3997466 | 9.3% |
| \ | 1425056 | 3.3% |
| N | 1425056 | 3.3% |
| 8 | 1406881 | 3.3% |
| 7 | 1300194 | 3.0% |
| 3 | 1189419 | 2.8% |
| 4 | 1173229 | 2.7% |
| Other values (2) | 2192206 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43130860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 11303024 | |
| 0 | 10420534 | |
| 1 | 7297795 | |
| 9 | 3997466 | 9.3% |
| \ | 1425056 | 3.3% |
| N | 1425056 | 3.3% |
| 8 | 1406881 | 3.3% |
| 7 | 1300194 | 3.0% |
| 3 | 1189419 | 2.8% |
| 4 | 1173229 | 2.7% |
| Other values (2) | 2192206 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43130860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 11303024 | |
| 0 | 10420534 | |
| 1 | 7297795 | |
| 9 | 3997466 | 9.3% |
| \ | 1425056 | 3.3% |
| N | 1425056 | 3.3% |
| 8 | 1406881 | 3.3% |
| 7 | 1300194 | 3.0% |
| 3 | 1189419 | 2.8% |
| 4 | 1173229 | 2.7% |
| Other values (2) | 2192206 | 5.1% |
endYear
Text
| Distinct | 97 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 647.1 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.0238279 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | \N |
|---|---|
| 2nd row | \N |
| 3rd row | \N |
| 4th row | \N |
| 5th row | \N |
| Value | Count | Frequency (%) |
| n | 11358288 | |
| 2019 | 7351 | 0.1% |
| 2018 | 7249 | 0.1% |
| 2017 | 7183 | 0.1% |
| 2020 | 7158 | 0.1% |
| 2021 | 7016 | 0.1% |
| 2022 | 6574 | 0.1% |
| 2023 | 6003 | 0.1% |
| 2016 | 5740 | < 0.1% |
| 2024 | 4965 | < 0.1% |
| Other values (87) | 77716 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| \ | 11358288 | |
| N | 11358288 | |
| 2 | 151696 | 0.7% |
| 0 | 139684 | 0.6% |
| 1 | 97361 | 0.4% |
| 9 | 60014 | 0.3% |
| 8 | 22136 | 0.1% |
| 7 | 18922 | 0.1% |
| 6 | 15484 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27857 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23264393 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| \ | 11358288 | |
| N | 11358288 | |
| 2 | 151696 | 0.7% |
| 0 | 139684 | 0.6% |
| 1 | 97361 | 0.4% |
| 9 | 60014 | 0.3% |
| 8 | 22136 | 0.1% |
| 7 | 18922 | 0.1% |
| 6 | 15484 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27857 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23264393 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| \ | 11358288 | |
| N | 11358288 | |
| 2 | 151696 | 0.7% |
| 0 | 139684 | 0.6% |
| 1 | 97361 | 0.4% |
| 9 | 60014 | 0.3% |
| 8 | 22136 | 0.1% |
| 7 | 18922 | 0.1% |
| 6 | 15484 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27857 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23264393 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| \ | 11358288 | |
| N | 11358288 | |
| 2 | 151696 | 0.7% |
| 0 | 139684 | 0.6% |
| 1 | 97361 | 0.4% |
| 9 | 60014 | 0.3% |
| 8 | 22136 | 0.1% |
| 7 | 18922 | 0.1% |
| 6 | 15484 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27857 | 0.1% |
runtimeMinutes
Text
| Distinct | 958 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 646.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 2 |
| Mean length | 1.9859531 |
| Min length | 1 |
Unique
| Unique | 280 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 12 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| n | 7843462 | |
| 30 | 340014 | 3.0% |
| 60 | 255433 | 2.2% |
| 22 | 198479 | 1.7% |
| 45 | 105179 | 0.9% |
| 15 | 102319 | 0.9% |
| 25 | 82346 | 0.7% |
| 44 | 82268 | 0.7% |
| 23 | 76341 | 0.7% |
| 10 | 76338 | 0.7% |
| Other values (948) | 2333064 | 20.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 7843465 | |
| \ | 7843462 | |
| 2 | 1210004 | 5.3% |
| 0 | 1096238 | 4.8% |
| 1 | 1008186 | 4.4% |
| 3 | 777093 | 3.4% |
| 4 | 767648 | 3.4% |
| 5 | 724935 | 3.2% |
| 6 | 524981 | 2.3% |
| 8 | 371069 | 1.6% |
| Other values (33) | 661932 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22829013 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 7843465 | |
| \ | 7843462 | |
| 2 | 1210004 | 5.3% |
| 0 | 1096238 | 4.8% |
| 1 | 1008186 | 4.4% |
| 3 | 777093 | 3.4% |
| 4 | 767648 | 3.4% |
| 5 | 724935 | 3.2% |
| 6 | 524981 | 2.3% |
| 8 | 371069 | 1.6% |
| Other values (33) | 661932 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22829013 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 7843465 | |
| \ | 7843462 | |
| 2 | 1210004 | 5.3% |
| 0 | 1096238 | 4.8% |
| 1 | 1008186 | 4.4% |
| 3 | 777093 | 3.4% |
| 4 | 767648 | 3.4% |
| 5 | 724935 | 3.2% |
| 6 | 524981 | 2.3% |
| 8 | 371069 | 1.6% |
| Other values (33) | 661932 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22829013 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 7843465 | |
| \ | 7843462 | |
| 2 | 1210004 | 5.3% |
| 0 | 1096238 | 4.8% |
| 1 | 1008186 | 4.4% |
| 3 | 777093 | 3.4% |
| 4 | 767648 | 3.4% |
| 5 | 724935 | 3.2% |
| 6 | 524981 | 2.3% |
| 8 | 371069 | 1.6% |
| Other values (33) | 661932 | 2.9% |
genres
Text
| Distinct | 2385 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 824 |
| Missing (%) | < 0.1% |
| Memory size | 744.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 10.943463 |
| Min length | 2 |
Unique
| Unique | 212 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Documentary,Short |
|---|---|
| 2nd row | Animation,Short |
| 3rd row | Animation,Comedy,Romance |
| 4th row | Animation,Short |
| 5th row | Short |
| Value | Count | Frequency (%) |
| drama | 1298543 | 11.3% |
| comedy | 751676 | 6.5% |
| talk-show | 724994 | 6.3% |
| news | 598531 | 5.2% |
| documentary | 554411 | 4.8% |
| drama,romance | 524295 | 4.6% |
| n | 506003 | 4.4% |
| reality-tv | 366213 | 3.2% |
| adult | 314423 | 2.7% |
| news,talk-show | 260224 | 2.3% |
| Other values (2375) | 5595106 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13323990 | 10.6% |
| m | 9981059 | 7.9% |
| o | 9574172 | 7.6% |
| r | 8416267 | 6.7% |
| e | 8411950 | 6.7% |
| , | 6838538 | 5.4% |
| y | 5832618 | 4.6% |
| t | 5792858 | 4.6% |
| i | 4831394 | 3.8% |
| n | 4506612 | 3.6% |
| Other values (27) | 48279288 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 125788746 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 13323990 | 10.6% |
| m | 9981059 | 7.9% |
| o | 9574172 | 7.6% |
| r | 8416267 | 6.7% |
| e | 8411950 | 6.7% |
| , | 6838538 | 5.4% |
| y | 5832618 | 4.6% |
| t | 5792858 | 4.6% |
| i | 4831394 | 3.8% |
| n | 4506612 | 3.6% |
| Other values (27) | 48279288 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 125788746 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 13323990 | 10.6% |
| m | 9981059 | 7.9% |
| o | 9574172 | 7.6% |
| r | 8416267 | 6.7% |
| e | 8411950 | 6.7% |
| , | 6838538 | 5.4% |
| y | 5832618 | 4.6% |
| t | 5792858 | 4.6% |
| i | 4831394 | 3.8% |
| n | 4506612 | 3.6% |
| Other values (27) | 48279288 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 125788746 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 13323990 | 10.6% |
| m | 9981059 | 7.9% |
| o | 9574172 | 7.6% |
| r | 8416267 | 6.7% |
| e | 8411950 | 6.7% |
| , | 6838538 | 5.4% |
| y | 5832618 | 4.6% |
| t | 5792858 | 4.6% |
| i | 4831394 | 3.8% |
| n | 4506612 | 3.6% |
| Other values (27) | 48279288 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| tconst | titleType | primaryTitle | originalTitle | isAdult | startYear | endYear | runtimeMinutes | genres | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | tt0000001 | short | Carmencita | Carmencita | 0 | 1894 | \N | 1 | Documentary,Short |
| 1 | tt0000002 | short | Le clown et ses chiens | Le clown et ses chiens | 0 | 1892 | \N | 5 | Animation,Short |
| 2 | tt0000003 | short | Poor Pierrot | Pauvre Pierrot | 0 | 1892 | \N | 5 | Animation,Comedy,Romance |
| 3 | tt0000004 | short | Un bon bock | Un bon bock | 0 | 1892 | \N | 12 | Animation,Short |
| 4 | tt0000005 | short | Blacksmith Scene | Blacksmith Scene | 0 | 1893 | \N | 1 | Short |
| 5 | tt0000006 | short | Chinese Opium Den | Chinese Opium Den | 0 | 1894 | \N | 1 | Short |
| 6 | tt0000007 | short | Corbett and Courtney Before the Kinetograph | Corbett and Courtney Before the Kinetograph | 0 | 1894 | \N | 1 | Short,Sport |
| 7 | tt0000008 | short | Edison Kinetoscopic Record of a Sneeze | Edison Kinetoscopic Record of a Sneeze | 0 | 1894 | \N | 1 | Documentary,Short |
| 8 | tt0000009 | movie | Miss Jerry | Miss Jerry | 0 | 1894 | \N | 45 | Romance |
| 9 | tt0000010 | short | Leaving the Factory | La sortie de l'usine Lumière à Lyon | 0 | 1895 | \N | 1 | Documentary,Short |
| tconst | titleType | primaryTitle | originalTitle | isAdult | startYear | endYear | runtimeMinutes | genres | |
|---|---|---|---|---|---|---|---|---|---|
| 11495233 | tt9916838 | tvEpisode | Episode #3.13 | Episode #3.13 | 0 | 2009 | \N | \N | Drama |
| 11495234 | tt9916840 | tvEpisode | Horrid Henry's Comic Caper | Horrid Henry's Comic Caper | 0 | 2014 | \N | 11 | Adventure,Animation,Comedy |
| 11495235 | tt9916842 | tvEpisode | Episode #3.16 | Episode #3.16 | 0 | 2009 | \N | \N | Drama |
| 11495236 | tt9916844 | tvEpisode | Episode #3.15 | Episode #3.15 | 0 | 2009 | \N | \N | Drama |
| 11495237 | tt9916846 | tvEpisode | Episode #3.18 | Episode #3.18 | 0 | 2009 | \N | \N | Drama |
| 11495238 | tt9916848 | tvEpisode | Episode #3.17 | Episode #3.17 | 0 | 2009 | \N | \N | Drama |
| 11495239 | tt9916850 | tvEpisode | Episode #3.19 | Episode #3.19 | 0 | 2010 | \N | \N | Drama |
| 11495240 | tt9916852 | tvEpisode | Episode #3.20 | Episode #3.20 | 0 | 2010 | \N | \N | Drama |
| 11495241 | tt9916856 | short | The Wind | The Wind | 0 | 2015 | \N | 27 | Short |
| 11495242 | tt9916880 | tvEpisode | Horrid Henry Knows It All | Horrid Henry Knows It All | 0 | 2014 | \N | 10 | Adventure,Animation,Comedy |